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Abstract — Telemonitoring of electroencephalogram (EEG) 
through wireless body-area networks is an evolving direction 
in personalized medicine. Among various constraints in de- 
signing such a system, three important constraints are energy 
consumption, data compression, and device cost. Conventional 
data compression methodologies, although effective in data com- 
pression, consumes significant energy and cannot reduce device 
cost. Compressed sensing (CS), as an emerging data compres- 
sion methodology, is promising in catering to these constraints. 
However, EEG is non-sparse in the time domain and also non- 
sparse in transformed domains (such as the wavelet domain). 
Therefore, it is extremely difficult for current CS algorithms to 
recover EEG with the quality that satisfies the requirements of 
clinical diagnosis and engineering applications. Recently, Block 
Sparse Bayesian Learning (BSBL) was proposed as a new method 
to the CS problem. This study introduces the technique to 
the telemonitoring of EEG. Experimental results show that its 
recovery quality is better than state-of-the-art CS algorithms, and 
sufficient for practical use. These results suggest that BSBL is 
very promising for telemonitoring of EEG and other non-sparse 
physiological signals. 

Index Terms — Telemonitoring, Healthcare, Wireless Body- 
Area Network (WBAN), Compressed Sensing (CS), Block Sparse 
Bayesian Learning (BSBL), electroencephalogram (EEG) 



I. Introduction 

Telemonitoring of electroencephalogram (EEG) via Wire- 
less Body- Area Networks (WBANs) is an evolving direction 
in personalized medicine and home-based e-Health. In such 
a system, a WBAN [1] integrates a number of sensors which 
collect and compress EEG. The compressed EEG is sent to 
a nearby smart-phone via ultra-low-power short-haul radios 
(e.g., Bluetooth), and then is transmitted to a remote terminal 
(e.g., a hospital) via the Internet. In the terminal, the original 
EEG is recovered by a computer. Equipped with the system, 
patients need not visit hospitals frequently. Instead, their EEG 
can be monitored continuously and ubiquitously. 

However, there are many constraints that have to be taken 
into account when designing such a system. The primary one 
is energy constraint [2]. Due to limitation on battery life, it is 
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necessary to reduce energy consumption as much as possible. 
Low energy consumption means that a system can use small 
and light batteries and sensors. Consequently, the light weight 
of the device can significantly improve the comfort level 
of patients. Besides, low energy consumption means longer 
battery and sensor lifetime, which reduces operational costs 
of the system. 

Another constraint is that transmitted physiological signals 
should be largely compressed. This is because the communi- 
cation capacity of ultra-low-power short-haul radio devices is 
low and can be stressed especially in some applications using 
multiple- sensors or high-speed sampling frequency. Besides, 
a WBAN generally uses a smart-phone as an intermediate 
transit point. Thus, it is important that data stream does not 
overwhelm the smart-phone, disturbing its primary functions 
such as receiving and making phone calls, playing games, and 
other mobile-based applications. 

The third constraint is hardware costs. Low hardware costs 
are more likely to make a telemonitoring system economi- 
cally viable and accepted by individual customers. However, 
low hardware costs mean that data compression (on sensors) 
should have low complexity and data recovery (in remote ter- 
minals) should not require sensors to pre-process raw signals 
when collecting them. 

It is noted that many conventional data compression 
methodologies such as wavelet compression cannot satisfy all 
the above constraints at the same time. It has been shown 
in [3] that compared to wavelet compression, Compressed 
Sensing (CS), when using sparse binary matrices as its sensing 
matrices, can reduce energy consumption while achieving 
competitive data compression ratio. Besides, the use of sparse 
binary matrices means the device cost can be largely reduced 
[3], [4]. However, current CS algorithms only work well for 
sparse signals or signals with sparse representation coefficients 
in some transformed domains (e.g., the wavelet domain). Since 
EEG is neither sparse in the original time domain nor sparse 
in transformed domains, current CS algorithms cannot achieve 
good recovery quality. 

To recover EEG signals with high quality that satisfies the 
needs of practical applications, this study proposes using Block 
Sparse Bayesian Learning (BSBL) [5], [6] to compress/recover 
EEG. The BSBL framework was initially proposed for signals 
with block structure [5] and has been successfully used for 
the telemonitoring of fetal ECG [7], which has obvious block 
structure. This study explores the feasibility of using the BSBL 
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technique for EEG, which is an example of a signal with an 
arbitrary waveform and without distinct block structure. The 
fidelity of recovered EEG signal is assessed by subsequent 
signal processing such as independent component analysis. 

The rest of the paper is organized as follows. Section HI] 
briefly introduces the CS model and the BSBL framework. 
Section (TTTJ presents the results. The last two sections discuss 
the results and conclude the paper. 

II. Compressed Sensing and Block Sparse Bayesian 
Learning 

Compressed Sensing (CS) [8] is a new data compression 
paradigm, in which a signal of length TV, denoted by x £ 
R iYxl , is compressed by a full row-rank random matrix, 
denoted by $ e R MxN (M <C N, Rank($) = M), i.e., 

y = *x, (1) 

where y is the compressed data, and is called the sensing 
matrix, which is known to CS algorithms for recovery. CS 
algorithms use the compressed data y and the sensing matrix 
3? to recover the original signal x. Their successes rely on the 
key assumption that most entries of the signal x are zero (i.e., 
x is sparse). When this assumption does not hold, one can 
seek a dictionary matrix, denoted by D G R MxM , so that x 
can be expressed as x = Dz and z is sparse. Then, the model 
0} can be re-written as 

y = $Dz. (2) 

Thus, CS algorithms can first recover z using y and 3>D, and 
then recover the original signal x by x = Dz. 

When CS is used in a telemonitoring system, signals are 
compressed on sensors according to (HJ. This compression 
stage consumes on-chip energy of the WBAN. The signals 
are recovered by a remote computer according to (0, where 
the matrix <3> is known to a CS algorithm and the matrix D is 
determined by a user. This stage does not consume any energy 
of the WBAN. 

CS has several advantages over wavelet compression. In 
[3], [7] it is shown that when the sensing matrix $ is a 
sparse binary matrix, in which most entries are zeros and 
only few entries are ones, CS algorithms cost less energy 
but have competitive compression ratio compared to wavelet 
compression. For example, when compressing a signal of 
length N, CS left-multiplies it by an M x N(M <C N) 
sparse binary matrix. Then amplitudes of the M compressed 
data are coded. In contrast, in wavelet compression this 
signal is first left-multiplied by an N x TV wavelet transform 
matrix with real entries. The wavelet coefficients with large 
amplitudes are found by a search algorithm, and both their 
amplitudes and locations are coded. Thus, CS requires fewer 
code execution on CPU. A second advantage of CS is that its 
compression operator greatly facilitates hardware design, since 
the implementation of multiplication with a binary matrix 
needs only accumulator registers. In wavelet compression, the 
compression involves multiplications of real numbers, which 
cannot be implemented by merely accumulator registers. 

Despite of these advantages, the use of CS in telemonitoring 
is only limited to a few types of signals, mainly because 



most physiological signals like EEG are not sparse in the time 
domain and not sparse enough in transformed domains. The 
issue now can be solved by the BSBL framework [5]. 

The BSBL framework was initially proposed for recovering 
a signal with block structure [5], [6]. It assumes the signal 
x can be partitioned into a concatenation of non-overlapping 
blocks, and a few of blocks are non-zero. Thus, it requires 
users to define the block partition of x. However, it turns 
out that such user-defined block partition does not need to 
be consistent with the true block partition of the signal [7]; 
in fact, the user-defined block partition can be viewed as a 
regularization for the estimation of the signal's co variance 
matrix. Further, in this work we found even if a signal has no 
distinct block structure, the BSBL framework is still effective. 
This makes feasible using BSBL for the CS of EEG and 
adopting the model for recovery, since EEG has arbitrary 
waveforms and the representation coefficients z generally lack 
block structure (see Fig ID. 

The BSBL framework has a pruning mechanism, which 
prunes out blocks in x (or in z if using the model ©) when the 
blocks have very small norms. However, EEG is non- sparse 
in both the time domain and transformed domains. Therefore, 
we disabled the pruning mechanism in our experiments. 

Currently, there are three algorithms in the BSBL frame- 
work. In our experiments we chose a bound-optimization 
based algorithm, denoted by BSBL-BO. Details on the algo- 
rithm and the BSBL framework can be found in [5]. 

III. Experiments of Compressed Sensing of EEG 

The following experimental compared BSBL-BO with some 
representative CS algorithms in terms of recovery quality. Be- 
cause all the CS algorithms adopted the same sensing matrix, 
they had equal energy consumption. Thus, the comparison of 
energy consumption is excluded. 

Two performance indexes were used to measure recovery 
quality. One was the Normalized Mean Square Error (NMSE), 
defined as ||x — x|||/ where x was the estimate of 
the true signal x. The second was the Structural SIMilarity 
index (SSIM) [9] for 1 -dimensional signals (the length of 
the sliding window was 100). SSIM measures the similarity 
between the recovered signal and the original signal, which 
is a better performance index than the NMSE for structured 
signals. Higher SSIM means better recovery quality. When the 
recovered signal is the same as the original signal, SSIM = 1. 

The following experiments used the model © to recover 
EEG. In the first experiment D was an inverse Discrete 
Cosine Transform (DCT) matrix, and thus z (z = D _1 x) are 
DCT coefficients. In the second experiment D was an inverse 
Daubechies-20 Wavelet Transform (WT) matrix, which was 
suggested in [4] for compressing EEG. In both experiments 
the sensing matrices <E> were sparse binary matrices, in which 
every column contained 15 entries equal to 1 with random 
locations while other entries were zeros. For BSBL-BO, 
we defined a block partition, where the starting location of 
each block was incremented by 24 (i.e., 1, 25, 49, • • • ). The 
maximum number of iterations for BSBL-BO was set to 7. 

Experiment codes can be downloaded at: 

https : //sites . google . com/ site /re sear chbyzhang/bsbl 
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Fig. 1. (a) An EEG epoch, and its DCT coefficients, (b) The recovery results 
by BSBL-BO, h, and Model-CoSaMP when using the model 0. 

TABLE I 

Averaged performance in Experiment 1 . 





NMSE (mean ± 


SSIM (mean ± 




std) 


std) 


DCT-based BSBL-BO 


0.078 ± 0.046 


0.85 ± 0.08 


BSBL-BO without DCT 


0.116 ± 0.066 


0.81 ± 0.09 


DCT-based h 


0.493 ± 0.121 


0.48 ±0.11 


DCT-based Block-CoSaMP 


0.434 ± 0.070 


0.45 ± 0.10 



A. Experiment 1: Compressed Sensing with DCT 

This example used a common dataset ('eeglab_data.set') in 
the EEGLab [10] to mimic the telemonitoring scenario by first 
compressing it and then recovering it. This dataset contains 
EEG signals of 32 channels with sequence length of 30720 
data points, and each channel signal contains 80 epochs each 
containing 384 points. Artifacts caused by muscle movement 
are also contained in the signals. 

To compress the signals epoch by epoch, we used a 192 x 
384 sparse binary matrix as the sensing matrix <I>, and a 384 x 
384 inverse DCT matrix as the dictionary matrix D. 

Two representative CS algorithms were compared in this 
experiment. One was the Model-CoSaMP [11], which has high 
performance for signals with known block structure. Here it 
used the same block partition as BSBL-BO. The second was 
an t\ algorithm used in [4] to recover EEG. The parameters 
of the two algorithms were tuned for optimal results. 

Figure [Ha) shows an EEG epoch and its DCT coefficients. 
Clearly, the DCT coefficients were not sparse and had no 
block structure. Figure Q2b) shows the recovery results of the 
three algorithms. Only BSBL-BO recovered the epoch with 
good quality; characteristic EEG peaks/troughs and oscillatory 
activities were accurately presented in the recovered signal. 
Table U shows the averaged NMSE and SSIM of the three 
algorithms on the whole dataset. It also lists the results when 
BSBL-BO directly recovered the signals without using the 
dictionary matrix (i.e., using the model (0Q)). The DCT-based 
BSBL-BO evidently had the best performance, and it took 
0.105 second per epoch on average on a computer with 2.8G 
CPU and 6G RAM. BSBL-BO without using the dictionary 
matrix took 0.271 second per epoch on average. 

In EEG analysis, a regular methodology is performing 
Independent Component Analysis (ICA) on scalp EEG data 
and then analyzing single-trial ERPs for each Independent 
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Fig. 2. An IC with focal back-projected scalp distribution derived (a) from 
the original EEG dataset and (b) from the recovered dataset. Another IC 
with dispersive scalp distribution derived (c) from the original EEG dataset 
and (d) from the recovered dataset. Each subfigure shows the back- projected 
scalp map, the ERP image, and the averaged ERP of an IC. 



Component (IC) [12]. Therefore, it is important to examine 
whether the obtained ICs from the recovered EEG dataset by 
BSBL-BO are the same as those from the original dataseH. 

This study performed ICA decomposition on the original 
EEG dataset and the recovered EEG dataset by BSBL-BO, 
respectively, using the Extended-Infomax algorithm with the 
same initialization, which is a build-in program in the EEGLab 
[10]. Then, we calculated the back-projected scalp map, the 
ERP image [12], and the averaged ERP of each IC from the 
original dataset and the reconstructed dataset. 

Figure [2] shows the results of two typical ICs (with large 
energy) from the recovered dataset (Figure [2] (b)(d)), and the 
results of corresponding ICs from the original dataset (Figure [2] 
(a)(c)). Comparing Figure [2] (a) with (b) and Figure [2] (c) with 
(d) reveals that there is little difference in terms of scalp maps, 
ERP images, and averaged ERPs. This implies that BSBL-BO 
can recover EEG signals with satisfactory quality, ensuring 
subsequent signal analysis with high fidelity. 

B. Experiment 2: Compressed Sensing with WT 

The second experiment used the dataset in [13]. It consists 
of multiple channel signals, each channel signal containing 
250 epochs for each of two events ('left direction' and 'right 
direction'). Each epoch consists of 256 sampling points. The 
goal in [13] is to differentiate the averaged ERP for the 'left 
direction' with the averaged ERP for the 'right direction'. For 
simplicity, we randomly chose a channel signal from the left 
parietal area. BSBL-BO and the previous t\ algorithm were 
compared. The sensing matrix <I> had the size of 128 x 256, 
and the dictionary matrix D had the size of 256 x 256. 

2 We only need to pay attention to the ICs with large energy, since in regular 
ICA analysis of EEG, ICs with large energy are reliable and meaningful. 
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Fig. 3. The ERPs corresponding to two event conditions ('left' and 'right') 
averaged (a) from the recovered epochs by the l\ algorithm, (b) from the 
recovered epochs by BSBL-BO, and (c) from the original dataset. 

For each event, we calculated the ERP by averaging the 
associated 250 recovered epochs. Figure 0(a) shows the ERP 
for the 'left direction' and the ERP for the 'right direction' 
averaged from the dataset recovered by the £i algorithm. 
Figure [3] (b) shows the two ERPs averaged from the recovered 
dataset by BSBL-BO. Figure [3] (c) shows the averaged ERPs 
from the original dataset (called genuine ERPs). Clearly, the 
resulting ERPs by the i\ algorithm were noisy. Although they 
maintained the main peaks of both genuine ERPs, they did not 
maintain other details of the genuine ERPs. Particularly, the 
difference between the two resulting ERPs from the 160 th to 
the 250 th time points was not clear. Besides, we found there 
were many brief oscillatory bursts in the recovered epochs by 
the t\ algorithm (due to space limit we omit the results here). 
In contrast, the ERPs averaged from the recovered epochs by 
BSBL-BO maintained all the details of the genuine ERPs with 
high fidelity. 

The SSIM and the NMSE of the resulting ERPs by the t x 
algorithm were 0.92 and 0.044, respectively. In contrast, the 
SSIM and the NMSE of the resulting ERPs by BSBL-BO were 
0.97 and 0.008, respectively. In the experiment BSBL-BO took 
0.06 second per epoch on average on the previous computer. 

IV. Discussions 

CS often resorts to a dictionary matrix to recover a non- 
sparse signal. However, the success of this approach heavily 
relies on the sparsity of its representation coefficients under the 
dictionary matrix. Therefore, finding a dictionary matrix under 
which a signal can be sparsely represented is very important. 
However, finding such a dictionary matrix for many physiolog- 
ical signals is challenging. We found that using various popular 
dictionary matrices, the representation coefficients of EEG 
signals are still not sparse. Therefore, current CS algorithms 
have poor performance, and their recovery quality is not suit- 
able for many clinical applications and cognitive neuroscience 
studies. Instead of seeking optimal dictionary matrices, this 
study proposed a method using general dictionary matrices 
yet achieving sufficient recovery quality for typical cognitive 
neuroscience studies. The empirical results suggest that when 
using the BSBL framework for EEG compression/recovery, 
the seeking of optimal dictionary matrices is not very crucial. 

Notably, if energy consumption is not an issue, using 
wavelet compression for EEG can result in better recovery 
quality than any CS algorithms including BSBL-BO. For ex- 
ample, in the second experiment when transmitting 128 largest 



wavelet coefficients (using the same wavelet transform), we 
obtained the ERPs with higher quality: the SSIM and the 
NMSE were 0.99 and 0.0003, respectively. Clearly, if energy 
consumption is not a problem, wavelet compression may be 
a better choice. But sometimes higher recovery quality is 
not necessarily required for practical applications. Considering 
the ERP analysis in the second experiment, the recovery 
quality by BSBL-BO satisfied the requirement of the ERP 
analysis, and thus higher recovery quality is not needed or 
attractive, especially at the cost of more energy consumption. 
Of course, the choice between CS and wavelet compression (or 
other compression techniques) for the telemonitoring of EEG 
probably differs case by case and thus needs further study. 

V. Conclusions 

Compressing EEG for telemonitoring is extremely difficult 
for current CS algorithms, because EEG is not sparse in the 
time domain nor sparse in transformed domains. To alleviate 
the problem, this study proposed to use the framework of block 
sparse Bayesian learning, which has superior performance to 
other existing CS algorithms in recovering non-sparse signals. 
Experimental results showed that it recovered EEG signals 
with good quality, ensuring subsequent signal analysis. Thus, 
it is very promising for wireless telemonitoring based cognitive 
neuroscience studies and engineering applications. 
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